On the consistency of the minimum evolution principle of phylogenetic inference

نویسندگان

  • François Denis
  • Olivier Gascuel
چکیده

The goal of phylogenetic inference is the reconstruction of the evolutionary history of various biological entities (taxa) such as genes, proteins, viruses or species. Phylogenetic inference is of major importance in computational biology and has numerous applications ranging from the study of biodiversity to sequence analysis. Given a matrix of pairwise distances between taxa, the minimum evolution (ME) principle consists in selecting the tree whose length is minimal, where the tree length is estimated within the least-squares framework. The ME principle has been shown to be statistically consistent when using the ordinary least-squares criterion (OLS) and inconsistent with the more general weighted least-squares criterion (WLS). Unfortunately, OLS+ME inference method can provide poor results since the variances of the input data are not taken into account. Here we study a model which lies between OLS and WLS, classical in statistics and data analysis, and we prove that the ME principle is statistically consistent within this model. Our proof is inductive and relies on a time optimal recursive algorithm for estimating edge lengths. As a corollary, we obtain a di<erent and simpler proof of the consistency result for OLS+ME. ? 2002 Elsevier Science B.V. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robustness of phylogenetic inference based on minimum evolution.

Minimum evolution is the guiding principle of an important class of distance-based phylogeny reconstruction methods, including neighbor-joining (NJ), which is the most cited tree inference algorithm to date. The minimum evolution principle involves searching for the tree with minimum length, where the length is estimated using various least-squares criteria. Since evolutionary distances cannot ...

متن کامل

Consistency, characters, and the likelihood of correct phylogenetic inference.

Computer simulations of character-state evolution in 8, 16, 32, and 64 ingroup taxa with a known set of relationships demonstrate that the maximum probability of correct phylogenetic inference increases with the number of variable (or informative) characters and their consistency index and decreases with the number of taxa, when the consistency index has been standardized to eliminate its depen...

متن کامل

The Minimum Evolution Distance-based Approach to Phylogenetic Inference

Distance algorithms remain among the most popular for reconstructing phylogenies, especially for researchers faced with data sets with large numbers of taxa. Distance algorithms are much faster in practice than character or likelihood algorithms, and least-squares algorithms produce trees that have several desirable statistical properties. The fast Neighbor Joining heuristic has proven to be qu...

متن کامل

Comparative Phylogenetic Perspectives on the Evolutionary Relationships in the Brine Shrimp Artemia Leach, 1819 (Crustacea: Anostraca) Based on Secondary Structure of ITS1 Gene

This is the first study on phylogenetic relationships in the genus Artemia Leach, 1819 using the pattern and sequence of secondary structures of internal transcribed spacer 1 (ITS1). Significant intraspecific variation in the secondary structure of ITS1 rRNA was found in Artemia tibetiana. In the phylogenetic tree based on joined primary and secondary structure sequences, Artemia urmiana and pa...

متن کامل

Double Fuzzy Implications-Based Restriction Inference Algorithm

The main condition of the differently implicational inferencealgorithm is reconsidered from a contrary direction, which motivatesa new fuzzy inference strategy, called the double fuzzyimplications-based restriction inference algorithm. New restrictioninference principle is proposed, which improves the principle of thefull implication restriction inference algorithm. Furthermore,focusing on the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Discrete Applied Mathematics

دوره 127  شماره 

صفحات  -

تاریخ انتشار 2003